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imm^] mmm 

6 6#B:&VALmi 8 8#B®Gln (Df^^JMLmtl 5—10 O^^-e^^fb-TS 
) Ki-SDNAM^S:-g•t^DNA^;t. 

©■p-^, 4 9 n t ~3 9 8 7 n t (fefc'L. 5 4 3 n t — 6 1 2 n t iCfe 

AGXliC AA(^^yigbfS[tt 1 5 — 1 0 OC^^T^-fbL, ^fe, ZLCOMiic 
I^J(Z)CAAttCAGT'feoTt;J:V^) 1 fB«<Z) D N Al^^-o 

[0 0 0 1] 

:^nmit. ^m^bm^^&2m i&.r. rscA2j ^lv^e)^i:*^fe'5>) (om 

[0 0 0 2] 

scA2ii, A^mjkwmnm^(Di&<Dmmzmm^^^^. nm^i^^m^ 

[0 0 0 3] 

WiT&. • gJ^^^^l/^ft^SSSilE (dentatorubral-pallidoluysian atr 

ophy (DRPLA)) ^ 5 V^T, MHitfe^lCliC AG*Ne>^ 

tS:*^3 7 — 1 0 0IlIT'fe-5<^{C^L, lES^ae^T'tiS 5?f^^T-fe€>o 
[0 0 0 4] 

l^$tlTV^S (Trottier, Y. et al. Nature, 378 , 403-406 (1995))o i^fj^L^ 
*^e>. SCA 2(7)Maa'g^«l^^^tlTfee>T. ^(Z)J^S@H^Jt;^e>*^T'«:fe 
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mmi>mm l. j: e> t-t^mm] 

[0 0 0 5] 

[0 0 0 6] 
[M®S:®g^-rsfe«)<^^©] 

mMm^(Dmm. SCA2S#^C^V^T(Z)^^CAG^U^^ 
hCD^yjIbiS[*'^it^LTV>S. 2. 5 k b(Z)TspEl |l^>t$:MmL. 

CAG h U h(Df^^JML^mti 2-o<Dmmz^ti^tl 

/\^^V ^'^X-t^yi-V X/Hr^ K$:->^n-:/^i bTMv^Tfc h c dna^ 

c DNA^;t$:^ n- — ^i^'b. $ e>icz:(D c dn AElT^T-Sr-^n-:/^ L 
rfliv^Tt: h c DN A^-r:/^ U 'J -— >^L/> 3CD:/n -:/i:/N-r 

U ^JVX-r-g) c DNAE^;t?:^t^^^n-r.>^*b7t„ rtie>(Z>c DNA|^;t<Z)i^ 
SlB^J$:S^:^Lfci::i^.. 3tle>^iSV^^Cfi^LTV^yco 5' 7^^«tK3' 7^ 
5§5{B!l®Mi^®j^^i2^rj?:^^t- -Sfei^C. ^^atCRACE (rapid ampl if icatio 
n of cDNA ends) $:^f^;&V^, $^tC5' {i(Z)^SI5(Dig»K^J$:S^^-r -5 ?ti*lC R 
T-PCR$:^^J5:^^^, S C A 2 ©MHit^^-® c DN A(Z)^M^(DJgSiE^J$:9St 

[0 0 0 7] 

l|166#B;S;V^LI|18 SSBOGln ©^UjgUtSftt 15—10 O^r^T^-fb 
[0 0 0 8] 

lEiB^J&n- K-r-SDNAM^^r-^tfo fe^'b. H 1 6 6#B3fcV^ bill 8 8# 

gcDGin ©^ujgbiiiii 5~i 0 o^raT^fb-r-So ^fe. rt^^yigb^^ 
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, ^^A-trlil 5 — 2 SMT'fey, — S C A 2 .m#T'li 3 5 ~ 1 0 0 MT'^ 

N A S C A 2 (Omm^M^^(D cDNAT'feS:ii: liTIHllifeMlCl^jZS $ tlT V> 
-5 = 

[0 0 0 9] 

:2is:5g^9f^DNA|^;tli. *fgWlCj:tJ^(DJSM^j7&^S!t^$tifeOT% h: h c 
DNA^-r->^^ U -^r^Mh bTMV^^PCR^C cfc ^^tC^o'^i^iPi ^ 

[0 0 10] 

[^»J] 

^ ti ^ ® T' (i i& V ^ „ 

[0 0 1 1 ] 

(1) (GAG) 55 :/n-:/®i^i^ 

CAGySS^-fe?: 5 Om^ti-m^k^^m, ' ^'^m}\^^Wmmm. (dentatorubra 
1-pal lidoluysian atrophy (DRPLA)) it-K^ (Roide, R. et al.. Nature Genet 

6, 9-13 (1994)) $:DRP L A,m^(Z)>^V ^ ^DNAT&^^ifipib, 
^ K^^^^-pT7Blue T (p-2093) □ - :i > ^* L T^o p-2093li. (CAG)^^ 

:&.xj-^(DM^oyy'y y^y ^'mn^^ti, -r^ibt), s'-cac cac cag caa gag g 

AA (CAG)55 '^^'^ '^^'^ CC-3' T^$tl-5i3^U$:-^tf= — )i^CD:tU 

•nyi^ Vi!t^\^^'y>{^-. -T^t)*). 5' -GAG GAG CAG GAA CAG GAA GA-S'jStU^ 
5'- fcf7f-5^>-GGG CGA GAG TTT GCG TGA TG-3' $:M V\T P C R fe. PC 

Rii, 10 mM Tris-HGl, pH 8.3, 50 mM KGl, 1.5 mM MgGlg, 2M N,N,N-SU>1^ 

3 09-3100183 
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;i/^^U>>>, 0.1 mM TTP, 0.1 mM dCTP, 0.1 mM dGTP, 9.25 MBq(D [ a-'^^P]dAT 
P (222 TBq/mmol), 0.5 /ctM •ro<7)#y^^ 0.3 ns<D-f=^7.S. KDNA ( 

p-2093) -BLm .0 U®Taq DNA:d<U^^--^ {B^mM^mcD^mmtWi) ^ 
-^tf^*l 6 1 (DmWL^m^'^X^^fc. fttJlcg 4rT'2^>^^'f4$:4fofe^ 
, 9 4*C, 1^<DW^. 5 4*0, 1 r$>a)T--U 7 21C, 3 :$)-<D#:R$: 3 

0-9--r ^;WfV^, S^JC7 2r;T' 1 0 4^r^#:RS:eofco 
[0 0 12] 

2 0 ytt 1 (DX h \y'ZfYT\fPy^WCS.Vf=.^i\L\:f:-X (Dynabeads M-280, Str 
eptavidin;/;i/i?:c-, Dynal ASft^) ^m^^X-^m.i^kG)^^ -fu-^^mm: 

5 mM Tris-HCl(pH7.5), 0. 5 mM EDTAS-D^IM N a C 1 §:-^t^ ■ 
4 0 >tt 1 (D?§?KT'tf-X$:i$fe#bfeo ^^V^■e5 0 ytt 1(Z)0. IM NaOH^M 

[0013] 

«^em*^9, 2 2, 4 3, 5 1 -e^-sr^ Fn^'> l^-fe-^^^-it'S^lCoi^T 
■9-if>^^*f$:4f o>t^^5, (CAG)55 ^ n - :/(iC A G^K^^fi:*^ 4 3 5 1 

M(7)ite^^^li^<7N-f ^VXLT^*^ 2 2M(7)5te^^i:tttSi:A.hV\-r 
U^VXit-f, 9M®it^^i:^i^<^^-r :7'y ^VXL^*^ofe(K. Sanpei et 
al ., Biochemical and Biophysical Research Communications, Vol.212, No. 2, 
1995, pp.341-346)o -t^J^t)^. 3 CO^ n - 7^$: M V^4^^^, A ^ ; - 

[0014] 

(2) SCA 2itfed^®J^«iH^»m^ 

1I5 1C«, S C A 2«#CD^^IlI?:^^c r(D^^|2I^CfeV^T> ^^ti^mn. 

4 ailE4#^0 9 - 3 1 0 0 1 8 3 
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[0 0 15] 

DNA (#15/ig) $rlO 0U(7)TspEI ( H*H:^K;^®m#^MttK) 

if-^3>Ji, 2.75 X SSPE (1 X SSPE =150 mM NaCl, 10 mM NaH2P04, 1 mM ED 
TA), 50% d>;i/AT^ K, 5x^yj\JWyWL. 100 ng/ml|?6^i^-^3W^DN A:R. 
tK(CAG)55 Zfu-Zf (6 x 10^ cpm/ml) $:-g-tfitMtfi-e 6 2 T' 1 S^^froife 
o /N^r^'U ^*>f if-i^3 0-5SK SDSSr-g-tfl x SSC (150 mM NaCl, 1 5 mM^ 

y - >S:fflV^, Kodak Bio Max MS:7W';i/AiC 1 6^^- 7 Ot^T^T- h^S/::*-^* 

[0016] 

^(Z)^^. ^TO) S C A 2m#lC0VNT(D*i, ■>^D--7'^/N-f ^*>rXLit 
2.5 kbp (^)TspEI 

[0 0 1 7] 

— AC^SCA2*# (ig5cfi(D7) *^ ^ ^ D N A $:^^{C <fc 

*;aaiL. ^® 2 7 o g^rispEi T-yi-fbu. T^*n-j^>^f;i/m^^»{C}^)>ttfc 

c ±fBL;^2.5 kbcDTspEI mfn ^"^^^ ^¥ J v ^ BN Am}i EcoRI "egi 
^LT^AZAPII (Stratagenett$g) ^ ^ - tC ^ O - :^ > L 7^o 
^ ^y^^>r-:/^ U-S:±fB(CAG)55 ■/^-^$:MV^, JikB Lfe A>f ^'^f if- 
C AGM^$:-^t?>!rV ^ ^ ^ n->Tspi*'J#||$ti;to 

[0 0 18] 

>riz-i/3 >l±5 x SSC, 1 X 1 0 %6S^-?dr:^ 20m 

MU h U '^7^, 4 0 0 g/m 1 (Z)tl hJB&^D N A;R.tKTspl::/D 
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0.1 X SSC-0.1% SDScfi-e 5 2TC-e 0. 5^^gfe?tLfco MSJf^;^^ y 

J^$:Kodak Bio Max MS^'W ;i/AlC- 7 O'er? 2 4 ^^r^T— h ^ t^'^f :7 >r 

[0019] 

^fe^ Tsp2(DSmaI-ApaI (630 bp) (Di^SM^i^Wi^l^. CAGS.'^mm 

Sr^tf cfc O l^yt U 5? ^ F F - l (5' -CCC TCA CCA TGT CGC TGA AGC-3' ) 

JScXJ^R - 1 (5'-CGA CGC TAG AAG GCC GCT G-3')$:^^b3t (0 1 #M) = h 
m^&McB-N A^^-r^V- (Stratagene^t^) ^yt V nyl ^ F - 1 

>ii, 6 X ssc, 10 X 0.5% sds, 0.05% n y h y '^A. 

100 ng/mi nm'^)-^m^BNARxf^mmm^v^^^iy:t^v-:fu-^^^ti 

5X:i:' 1 S^^fTofec /N-f >^y ^SJ*-f -fe'-i^H^^. M$:SJ^«>lCO. 
5% SDS ;5:t?0.05% t°Dy >^:^^y "^TA^r-^tfe X SSC tfj-T'S S-CT'O. 5^ 

^gfeTtbfeo itl:i^<Dyn- >^i:7\^-7*y 3?*>rXi-s, 4.0 kb<^ c dn n-> 

Fci t)^m^nr:La fci , ispi^mspzay^mm^i^^^L. itmLf=.t^?>. c 

AG;S^M^i£€l(^^»iS^fJtt. CAG(7)>S^IlIfS:?:l§^^-i^CbT^^fe. ^fe, 
Tspi;g:msp2(7)MI^^*%|giafeU^{CFcl ^tJ^^xE<Z)ffiOm>/t<7)-ff-r X.5.tJf^«$: 
aetc^fo Se^fC, Pel XtiJ^AI^cD^^' y-:^>^*iC j:y#{|t$4xfeW>^$:y 
n- >''^ LTfflV^Ta^c^t h c DN A^-f y - (t htffJig^gK, t: hMif. 
fli, . t hfliS-tJ^h: hfiM#) $:Xd7 y -->iJ^t--5z:i:lCi; y, cDNA^n-> 
Fc2 , Fbl4, B4. C6^t;^C19 ^mMLtz (ia6#M) o Fci ©5' ^^^ril^i" 
■£>;t«?)fC, 5' -RACE-Ready cDNA (Clonetech, Palo Alto, CA, USA) 5: M V^T5' -R 
ACE ?:;ffofc (Frohman, M.A. et al, Proc. Natl. Acad. Sci. USA 85, 8998-9 
002 (1988)) e -f^^-^-R-l^miCDPCRlzm^K y^^V-R-2 (5' 
-CTT GOG GAC ATT GGC AGC C-3' , H 1 #M) 5:11 2 CD P C R tCM V^ fe„ J&fe, 
forward Mc^:?'^ -TV- ^ Lr^iv^-rtl<Z)^-^=foF- 1 (0 1 #M) $:Mv^fe. 
350 bp(Z)5'-RACE^% (5R1) S:pT7Blue T ^ ^ ^ - {C-9-:f ^ O - - > ^* L (pT 

6 ffi|E#¥0 9-3100183 
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7Blue T-vector(5Rl) o 5R1 CD|hF— Fcl, Tspl :S.msp2(DiSSiH^Ji:®:4- 
-A-'^^y ^IC j: »J?t^Lfc. c DN ACD3*' :?K^$:l^^i""5fc*fC, t hflgj*^ 
e>}Sm$tlfcpoly(A)'^inRNA(D 1 itt g $:^Mi:bTfflV^, ^^>fV-F-13 (5' 
-TTC TCT CAG CCA AAG CCT TCT ACT ACC-3' , 0 3 S:>^^>fV-i:UTM 
V^r3'-RACE Sr^T^ofeo ^e>lxfcl300 bp ©3' -RACE Higj (3R1) $:pT7Blue T 
^-lZ^:f^U- — :y^'l^t:. (pTTBlue T-vector(3Rl)), 
[0 0 2 0] 

cDNA©5' m^^^^izm^^r^mz. mm^PCR (RT-PCR) SrtfJfco 
feo i-^&^Dt.. bl hflej*^e>(D;j-- hyi/-*^e»ttmsn7^^RNAS:$fe-r, RNas 
e 7U— ©DNase (Promega ^tSS) "^M^bL/fc (Onodera, 0. et al.. Am. J, H 
um. Geent. 57, 1050-1060(1995)) „ m^^Ki^H. 2 g (D^ R N A il. 2 0 
pmol(D^>'^A^^-tJ-V— $:MV^, 4 2X:-e^T^ofc (Onodera, 0. et al . , gtf 
%) o PCR(D^^-Y-v'-h L/Tli, F1006(5'-TAT CCG CAG CTC CGC TCC C-3* 
> mi0M) S:tKR1002(5'-AGC CGG GCC GAA ACG CGC CG-3\ M 1 0M) $:Mv^ 
fe„ 5pmol-ro(Z)#y^-r 10 mM Tris HCl (pH8.3) , 50 mM KCl, 1.5 mM 

MgClg, 1.7M N,N,N- h U 200 fiTi -fcxDdATF, dCTPjRtf^TTP 

, 100 /iM ©dGTP, 100 /tiM ® 7 - T^Tif dGTP^tJf2.5 U (DTaq ^KU^^-iZ ( 

wmmi^^n^^^r^^. 9 e x: i ^j^r^^^isxs, e 5n^^®T--u 

Xg, 7 2t: 1 5^'f^(Z)#:RXg*^^^^if>r 3 Om^UilL, m^{C7 2 

X:i:5-5^mW^-t^^iHZjzi)PCR^n^£^r~o ^<D'^m. 5R1 ©J:gP.{C246 
bpMlJf-g). ^^n-^SRl^r^ife (S6#M) o 
[0 0 2 1 ] 

^■t^m^^^-to ^fe. SCA2 c DNAcfi®e^^(Z)gP:0'{i=l- K^^5:^ 
■To CAG;K^M^®&gl*M^y ©3^^--/^r;^.-e^$tlTV^S„ TspEl (T) , No 
tl (N), Sac II (S), Sau3AI (Sa) , Sma I (Sm), Eco52I (E52), Apa I (Ap). 
AccI (Ac) , BamHI (B) , Xhol (X), EcoRI (E) S.U^Pst I (P) (DmMmmB&. 

*^5^$^lTv^€>o #c DN A^' n-><Z)-t^>rx®:t)ffig*'Jn>-fe>-9-:7.sc A 2 

7 09-3100183 
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c DN A©TlC5^$tlTVNS, 
[0 0 2 2] 

yt^ Uyt^\^^3L — >'$t — ^:^-t/3y^ (Sanger, F. et al. Proc. Natl. Ac 

ad. Sci. USA 74, 5463-5467(1977); Chen E.Y. et al, DNA 4, 165-170 (1985) 

^<7)igSiH^J?r^^-r-2>:t«?){C, bf^^-^^ffcbfcF-l &tJfRS-l(5'-CCT CGG TGT 
CGC GGC GAG TTC C-3' ) V^ P C R J; U , C AGKUM^ii^ti^ J 5. y 
^^»T.Hr5:ififiLfeo 0.25^M-ro(D#:;''^>rv-, 10 mM Tris HCl (pH8.3), 5 
0 mM KCl, 2.0 mM MgClg, 1.7M N ,N,N- hU U 200 fiJl -f-DCOdH 

TP, 200 ngOy^J^v^DNA. UCDTaq ;j<y>t^--^ (^Mst^fc^) 

[0 0 2 3] 

Jiia^c DN n->(Di^Si2^!J{C^^^, /i<y A-5^-;bS:I§;v\T4 3 5 1 
b p<Z)3>-fe>-y-^ S C A 2 c DN A@e^iJ$:^^L:fe (@E^J#-f- 1 jS-tJ^I^ 1 ~ 4 
, ia6#M) o BH^'JS-t l®4352nt~4367nt iijjf U At^-;!/ 

, ^AaIL^ c DN A^ n->T^^C 1 9, B 4 jSiOf 3 R 1 iCfe V^T, |^^®^ 
Slc^KU A7^-;i/*"#iS-r-2>3i:7b'5S^$tifc„ 
[0 0 2 4] 

^#:4^(Z)CAG;5«#fet5C®M^ 




-F - i;5:tFR- 1 $:MVNT#e)4x:fePCRM%S:^y T;J7 y ;ur^ 



K>^;bm^?acl&ic*^i-t-5 3i:{cj; c AG;Km#^(Z)i5c5:?iW^ t;*^:.. pcr« 

, 10 mM Tris-HCl, pH 8.3, 50 mM KCl, 2.0 mM MgClg, 1.7 M N,N,N-hU>t5^ 
;i/yV>>>, lllKBq(Z)[ a- ^2p]dCTP (111 Tbq/mmol), 30 /tt M dCTP,# 200 
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fiM(Dm?, dGTP;5Lt?TTP, #0.25 >ttM(Z>±ia 2 a!l®0:/^>f 200 ng(D^ 

J ^ ^rDNAM0JC1.25 UCDTaq DNA U ;>l ^ -if 1 0 n 1 (35^?K 

tf'T'fTofco ^-r, 9 sr. 2^^^^fe$:^fofe#, 9 5t:, 1 ^^OD^tt, 6 
Ot;, l^fSOT--U>^7*S:tJf 7 2X3, l^^^(Z)#S;()^^^S-9->r^;i/S:3 2 
m^yaib, «^JC7 2t:, 5^ra(Z)#«$:iff ofco SCA2ae^CD^n-> 
<b$4xfe>?rV ^ ^MJic$:fflV^T#e)tlfc. a>^Cr>i^-^X<Z)C AGS^^feSr-g- 
tfiB^J^^f*- (sequence ladder) $:-9--r Xv- - LTfflV^fc, CAG^^^ 
dgctfiJC C A A $:-g-t;iES^M5^«^^0)^-^lCli, rtl ©C A A#^aG A G 
)5^-y--f XlC-^i;?)fe„ CAGM^*^ft:^L-rv>SSCA2M5i*'B^®^'^, C 

A GM^e^©±fB#AiE^jttc A G^j^(Z)-«?->r x^c^i-g•^?)^*^o feo 

[0 0 2 5] 

4 a® S C A 2 tC^3l-t-5CAG^^#fi[f5C$:M^bfeo j^^S:07lC^ 

[0 0 2 6] 

TX&^(DIZML. SCA 2iie^Tli^T3 5MJ^i±Tfeo:tc ^oT, JifH 
{Ccfc i;2RF^$tl7c c DN Ati, S C A 2(DMEit^^® c DN AX Zi t.ti^m 

[0 0 2 7] 

[ie^j«] 
m^m^ : 1 

i2?U®:R$ : 4 3 6 7 

m^i<Dm : 

TATCCGCACC TCCGCTCCCA CCCGGCGCCT CGGCGCGCCC GCCCTCCG ATG CGC TCA 57 

9 ailE# ¥ 09-3100183 
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Met Arg Ser 

1 

GCG GCC GCA GCT COT CGG AGT CCC GCG GTG GCC ACC GAG TCT CGC CGC 105 
Ala Ala Ala Ala Pro Arg Ser Pro Ala Val Ala Thr Glu Ser Arg Arg 

5 10 15 

TTC GCC GCA GCC AGG TGG CCC GGG TGG CGC TCG CTC CAG CGG CCG GCG 153 
Phe Ala Ala Ala Arg Trp Pro Gly Trp Arg Ser Leu Gin Arg Pro Ala 
20 25 30 35 

CGG CGG AGC GGG CGG GGC GGC GGT GGC GCG GCC CCG GGA CCG TAT CCC 201 
Arg Arg Ser Gly Arg Gly Gly Gly Gly Ala Ala Pro Gly Pro Tyr Pro 

40 45 50 

TCC GCC GCC CCT CCC CCG CCC GGC CCC GGC CCC CCT CCC TCC CGG CAG 249 
Ser Ala Ala Pro Pro Pro Pro Gly Pro Gly Pro Pro Pro Ser Arg Gin 

55 60 65 

AGC TCG CCT CCC TCC GCC TCA GAC TGT TTT GGT AGC AAC GGC AAC GGC 297 
Ser Ser Pro Pro Ser Ala Ser Asp Cys Phe Gly Ser Asn Gly Asn Gly 

70 75 80 

GGC GGC GCG TTT CGG CCC GGC TCC CGG CGG CTC CTT GGT CTC GGC GGG 345 
Gly Gly Ala Phe Arg Pro Gly Ser Arg Arg Leu Leu Gly Leu Gly Gly 

85 90 95 

CCT CCC CGC CCC TTC GTC GTC GTC CTT CTC CCC CTC GCC AGC CCG GGC 393 
Pro Pro Arg Pro Phe Val Val Val Leu Leu Pro Leu Ala Ser Pro Gly 
100 105 110 115 

GCC CCT CCG GCC GCG CCA ACC CGC GCC TCC CCG CTC GGC GCC CGT GCG 441 
Ala Pro Pro Ala Ala Pro Thr Arg Ala Ser Pro Leu Gly Ala Arg Ala 

120 125 130 

TCC CCG CCG CGT TCC GGC GTC TCC TTG GCG CGC CCG GCT CCC GGC TGT 489 
Ser Pro Pro Arg Ser Gly Val Ser Leu Ala Arg Pro Ala Pro Gly Cys 

135 140 145 

1 0 miiE#¥ 09-3100183 
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CCC CGC CCG GCG TGC GAG COG GTG TAT GGG CCC GIG ACC ATG TCG CTG 537 
Pro Arg Pro Ala Gys Glu Pro Val Tyr Gly Pro Leu Thr Met Ser Leu 

150 155 160 

AAG CCC GAG CAG GAG CAG GAG CAG CAG GAG GAG CAG CAG CAG GAG CAA 585 
Lys Pro Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 

165 170 175 

CAG CAG CAG CAG CAG CAG CAG CAG CAG CCG CCG CCC GCG GCT GCC AAT 633 
Gin Gin Gin Gin Gin Gin Gin Gin Gin Pro Pro Pro Ala Ala Ala Asn 
180 185 190 195 

GTC CGC AAG CCC GGC GGC AGC GGC CTT CTA GCG TCG CCC GCC GCC GCG 681 
Val Arg Lys Pro Gly Gly Ser Gly Leu Leu Ala Ser Pro Ala Ala Ala 

200 205 210 

CCT TCG CCG TCC TOG TCC TCG GTC TCC TCG TCG TCG GCC ACG GCT CCC 729 
Pro Ser Pro Ser Ser Ser Ser Val Ser Ser Ser Ser Ala Thr Ala Pro 

215 220 225 

TCC TCG GTG GTC GCG GCG ACC TCC GGC GGC GGG AGG CCC GGC CTG GGC 777 
Ser Ser Val Val Ala Ala Thr Ser Gly Gly Gly Arg Pro Gly Leu Gly 

230 235 240 

AGA GGT CGA AAC ACT AAG AAA GGA CTG CCT CAG TCT ACG ATT TCT TTT 825 
Arg Gly Arg Asn Ser Asn Lys Gly Leu Pro Gin Ser Thr He Ser Phe 

245 250 255 

GAT . GGA ATC TAT GGA AAT ATG AGG ATG GTT CAT ATA CTT ACA TCA GTT 873 
Asp Gly He Tyr Ala Asn Met Arg Met Val His He Leu Thr Ser Val 
260 265 270 275 

GTT GGC TCC AAA TGT GAA GTA CAA GTG AAA AAT GGA GGT ATA TAT GAA 921 
Val Gly Ser Lys Cys Glu Val Gin Val Lys Asn Gly Gly He Tyr Glu 

280 285 290 

GGA GTT TTT AAA ACT TAG ACT CCG AAG TGT GAT TTG GTA CTT GAT GCC 969 
Gly Val Phe Lys Thr Tyr Ser Pro Lys Cys Asp Leu Val Leu Asp Ala 
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295 300 305 

GCA CAT GAG AAA AGT ACA GAA TCC AGT TCG GGG COG AAA CGT GAA GAA 1017 
Ala His Glu Lys Ser Thr Glu Ser Ser Ser Gly Pro Lys Arg Glu Glu 

310 315 320 

ATA ATG GAG AGT ATT TTG TTC AAA TGT TCA GAG TTT GTT GTG GTA CAG 1065 

He Met Glu Ser He Leu Phe Lys Cys Ser Asp Phe Val Val Val Gin 

325 330 335 

TTT AAA GAT ATG GAG TCC AGT TAT GCA AAA AGA GAT GCT TTT ACT GAC 1113 

Phe Lys Asp Met Asp Ser Ser Tyr Ala Lys Arg Asp Ala Phe Thr Asp 
340 345 350 355 

TCT GCT ATC AGT GCT AAA GTG AAT GGC GAA CAC AAA GAG AAG GAC CTG 1161 

Ser Ala He Ser Ala Lys Val Asn Gly Glu His Lys Glu Lys Asp Leu 

360 365 370 

GAG CCC TGG GAT GCA GGT GAA CTC ACA GCC AAT GAG GAA CTT GAG GCT 1209 

Glu Pro Trp Asp Ala Gly Glu Leu Thr Ala Asn Glu Glu Leu Glu Ala 

375 380 385 

TTG GAA AAT GAC GTA TCT AAT GGA TGG GAT CCC AAT GAT ATG TTT CGA 1257 

Leu Glu Asn Asp Val Ser Asn Gly Trp Asp Pro Asn Asp Met Phe Arg 

390 395 400 

TAT AAT GAA GAA AAT TAT GGT GTA GTG TCT ACG TAT GAT AGC AGT TTA 1305 

Tyr Asn Glu Glu Asn Tyr Gly Val Val Ser Thr Tyr Asp Ser Ser Leu 

405 410 415 

TCT TCG TAT ACA GTG CCC TTA GAA AGA GAT AAC TCA GAA GAA TTT TTA 1353 

Ser Ser Tyr Thr Val Pro Leu Glu Arg Asp Asn Ser Glu Glu Phe Leu 
420 425 430 435 

AAA CGG GAA GCA AGG GCA AAC CAG TTA GCA GAA GAA ATT GAG TCA AGT 1401 

Lys Arg Glu Ala Arg Ala Asn Gin Leu Ala Glu Glu He Glu Ser Ser 

440 445 450 

GCC CAG TAC AAA GCT CGA GTG GCC CTG GAA AAC GAT GAT AGG AGT GAG 1449 
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Ala Gin Tyr Lys Ala Arg Val Ala Leu.Glu Asn Asp Asp Arg Ser Glu 

455 460 465 

GAA GAA AAA TAG ACA GCA GTT GAG AGA AAT TCC AGT GAA CGT GAG GGG 1497 

Glu Glu Lys Tyr Thr Ala Val Gin Arg Asn Ser Ser Glu Arg Glu Gly 

470 475 480 

CAC AGC ATA AAC ACT AGG GAA AAT AAA TAT ATT OCT OCT GGA GAA AGA 1545 

His Ser He Asn Thr Arg Glu Asn Lys Tyr lie Pro Pro Gly Gin Arg 

485 490 495 

AAT AGA GAA GTC ATA TCC TGG GGA AGT GGG AGA CAG AAT TCA CCG CGT 1593 

Asn Arg Glu Val He Ser Trp Gly Ser Gly Arg Gin Asn Ser Pro Arg 
500 505 510 515 

ATG GGC CAG CCT GGA TCG GGC TCC ATG CCA TCA AGA TCC ACT TCT CAC 1641 

Met Gly Gin Pro Gly Ser Gly Ser Met Pro Ser Arg Ser Thr Ser His 

520 525 530 

ACT TCA GAT TTC AAC CCG AAT TCT GGT TCA GAC CAA AGA GTA GTT AAT 1689 

Thr Ser Asp Phe Asn Pro Asn Ser Gly Ser Asp Gin Arg Val Val Asn 

535 540 545 

GGA GGT GTT CCC TGG CCA TCG CCT TGC CCA TCT CCT TCC TCT CGC CCA 1737 

Gly Gly Val Pro Trp Pro Ser Pro Cys Pro Ser Pro Ser Ser Arg Pro 

550 555 560 

CCT TCT CGC TAC CAG TCA GGT CCC AAC TCT CTT CCA CCT CGG GCA GCC 1785 
Pro Ser Arg Tyr Gin Ser Gly Pro Asn Ser Leu Pro Pro Arg Ala Ala 

565 570 575 

ACC CCT ACA CGG CCG CCC TCC AGG CCC CCC TCG CGG CCA TCC AGA CCC 1833 

Thr Pro Thr Arg Pro Pro Ser Arg Pro Pro Ser Arg Pro Ser Arg Pro 
580 585 590 595 
CCG TCT CAC CCC TCT GCT CAT GGT TCT CCA GCT CCT GTC TCT ACT ATG 1881 

Pro Ser His Pro Ser Ala His Gly Ser Pro Ala Pro Val Ser Thr Met 

600 605 610 
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CCT AAA CGC ATG TCT TCA GAA GGG COT CCA AGG ATG TCC CCA AAG GCC 1929 
Pro Lys Arg Met Ser Ser Glu Gly Pro Pro Arg Met Ser Pro Lys Ala 

615 620 625 

CAG CGA CAT CCT CGA AAT CAC AGA GTT TCT GCT GGG AGG GGT TCC ATA 1977 
Gin Arg His Pro Arg Asn His Arg Val Ser Ala Gly Arg Gly Ser He 

630 635 640 

TCC ACT GGC CTA GAA TTT GTA TCC CAC AAC CCA CCC ACT GAA GCA GCT 2025 
Ser Ser Gly Leu Glu Phe Val Ser His Asn Pro Pro Ser Glu Ala Ala 

645 650 655 

ACT CCT CCA GTA GCA AGG ACC ACT CCC TCG GGG GGA ACG TGG TCA TCA 2073 
Thr Pro Pro Val Ala Arg Thr Ser Pro Ser Gly Gly Thr Trp Ser Ser 
660 665 670 675 

GTG GTC AGT GGG GTT CCA AGA TTA TCC CCT AAA ACT CAT AGA CCC AGG 2121 
Val Val Ser Gly Val Pro Arg Leu Ser Pro Lys Thr His Arg Pro Arg 

680 685 690 

TCT CCC AGA CAG AAC AGT ATT GGA AAT ACC CCC AGT GGG CCA GTT CTT 2169 
Ser Pro Arg Gin Asn Ser lie Gly Asn Thr Pro Ser Gly Pro Val Leu 

695 700 705 

GCT TCT CCC CAA GCT GGT ATT ATT CCA ACT GAA GCT GTT GCC ATG CCT 2217 
Ala Ser Pro Gin Ala Gly He He Pro Thr Glu Ala Val Ala Met Pro 

710 715 720 

ATT. CCA GCT GCA TCT CCT ACG CCT GCT AGT CCT GCA TCG AAC AGA GCT 2265 
He Pro Ala Ala Ser Pro Thr Pro Ala Ser Pro Ala Ser Asn Arg Ala 

725 730 735 

GTT ACC CCT TCT AGT GAG GCT AAA GAT TCC AGG CTT CAA GAT CAG AGG 2313 
Val Thr Pro Ser Ser Glu Ala Lys Asp Ser Arg Leu Gin Asp Gin Arg 
740 745 750 755 

CAG AAC TCT CCT GCA GGG AAT AAA GAA AAT ATT AAA CCC AAT GAA ACA 2361 
Gin Asn Ser Pro Ala Gly Asn Lys Glu Asn He Lys Pro Asn Glu Thr 
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760 765 770 

TCA CCT AGC TTC TCA AAA GCT GAA AAC AAA GGT ATA TCA CCA GTT GTT 2409 
Ser Pro Ser Phe Ser Lys Ala Glu Asn Lys Gly He Ser Pro Val Val 

775 780 785 

TCT GAA CAT AGA AAA CAG ATT GAT GAT TTk AAG AAA TTT AAG AAT GAT 2457 
Ser Glu His Arg Lys Gin He Asp Asp Leu Lys Lys Phe Lys Asn Asp 

790 795 800 

TTT AGG TTA CAG CCA ACT TCT ACT TCT GAA TCT ATG GAT CAA CTA CTA 2505 
Phe Arg Leu Gin Pro Ser Ser Thr Ser Glu Ser Met Asp Gin Leu Leu 

805 810 815 

AAC AAA AAT AGA GAG GGA GAA AAA TCA AGA GAT TTG ATC AAA GAC AAA 2553 
Asn Lys Asn Arg Glu Gly Glu Lys Ser Arg Asp Leu He Lys Asp Lys 
820 825 830 835 

ATT GAA CCA AGT GCT AAG GAT TCT TTC ATT GAA AAT AGC AGC AGC AAC 2601 
He Glu Pro Ser Ala Lys Asp Ser Phe lie Glu Asn Ser Ser Ser Asn 

840 845 850 

TGT ACC AGT GGC AGC AGC AAG CCG AAT AGC CCC AGC ATT TCC CCT TCA 2649 
Cys Thr Ser Gly Ser Ser Lys Pro Asn Ser Pro Ser He Ser Pro Ser 

855 860 865 

ATA CTT AGT AAC ACG GAG CAC AAG AGG GGA CCT GAG GTC ACT TCC CAA 2697 
He Leu Ser Asn Thr Glu His Lys Arg Gly Pro Glu Val Thr Ser Gin 

870 875 880 

GGG GTT CAG ACT TCC AGC CCA GCA TGT AAA CAA GAG AAA GAC GAT AAG 2745 
Gly Val Gin Thr Ser Ser Pro Ala Cys Lys Gin Glu Lys Asp Asp Lys 

885 890 895 

GAA GAG AAG AAA GAC GCA GCT GAG CAA GTT AGG AAA TCA ACA TTG AAT 2793 
Glu Glu Lys Lys Asp Ala Ala Glu Gin Val Arg Lys Ser Thr Leu Asn 
900 905 910 915 

CCC AAT GCA AAG GAG TTC AAC CCA CGT TCC TTC TCT CAG CCA AAG CCT 2841 
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Pro Asn Ala Lys Glu Phe Asn Pro Arg Ser Phe Set Gin Pro Lys Pro 

920 925 930 

TCT ACT ACC CCA ACT TCA CCT CGG CCT CAA GCA CAA CCT AGC CCA TCT 2889 
Ser Thr Thr Pro Thr Ser Pro Arg Pro Gin Ala Gin Pro Ser Pro Ser 

935 940 945 

ATG GTG GGT CAT CAA CAG CCA ACT CCA GTT TAT ACT CAG CCT GTT TGT 2937 
Met Val Gly His Gin Gin Pro Thr Pro Val Tyr Thr Gin Pro Val Cys 

950 955 960 

TTT GCA CCA AAT ATG ATG TAT CCA GTC CCA GTG AGC CCA GGC GTG CAA 2985 
Phe Ala Pro Asn Met Met Tyr Pro Val Pro Val Ser Pro Gly Val Gin 

965 970 975 

CCT TTA TAG CCA ATA CCT ATG ACG CCC ATG CCA GTG AAT CAA GCC AAG 3033 
Pro Leu Tyr Pro He Pro Met Thr Pro Met Pro Val Asn Gin Ala Lys 
980 985 990 995 

ACA TAT AGA GCA GTA CCA AAT ATG CCC CAA CAG CGG CAA GAC CAG CAT 3081 
Thr Tyr Arg Ala Val Pro Asn Met Pro Gin Gin Arg Gin Asp Gin His 

1000 1005 1010 

CAT CAG ACT GCC ATG ATG CAC CCA GCG TCA GCA GCG GGC CCA CCG ATT 3129 
His Gin Ser Ala Met Met His Pro Ala Ser Ala Ala Gly Pro Pro He 

1015 1020 1025 

GCA GCC ACC CCA CCA GCT TAC TCC ACG CAA TAT GTT GCC TAC ACT CCT 3177 
Ala Ala Thr Pro Pro Ala Tyr Ser Thr Gin Tyr Val Ala Tyr Ser Pro 

1030 1035 1040 

CAG CAG TTC CCA AAT CAG CCC CTT GTT CAG CAT GTG CCA CAT TAT CAG 3225 
Gin Gin Phe Pro Asn Gin Pro Leu Val Gin His Val Pro His Tyr Gin 

1045 1050 1055 

TCT CAG CAT CCT CAT GTC TAT AGT CCT GTA ATA CAG GGT AAT GCT AGA 3273 
Ser Gin His Pro His Val Tyr Ser Pro Val He Gin Gly Asn Ala Arg 
1060 1065 1070 1075 
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ATG ATG GCA CCA CCA ACA CAC GCC CAG CCT GGT TTA GTA TCT TCT TCA 3321 
Met Met Ala Pro Pro Thr His Ala Gin Pro Gly Leu Val Ser Ser Ser 

1080 1085 1090 

GCA ACT CAG TAC GGG GCT CAT GAG CAG ACG CAT GCG ATG TAT GCA TGT 3369 
Ala Thr Gin Tyr Gly Ala His Glu Gin Thr His Ala Met Tyr Ala Cys 

1095 1100 1105 

CCC AAA TTA CCA TAC AAC AAG GAG ACA AGC CCT TCT TTC TAC TTT GCC 3417 
Pro Lys Leu Pro Tyr Asn Lys Glu Thr Ser Pro Ser Phe Tyr Phe Ala 

1110 1115 1120 

ATT TCC ACG GGC TCC CTT GCT CAG CAG TAT GCG CAC CCT AAC GCT ACC 3465 
He Ser Thr Gly Ser Leu Ala Gin Gin Tyr Ala His Pro Asn Ala Thr 

1125 1130 1135 

CTG CAC CCA CAT ACT CCA CAC CCT CAG CCT TCA GCT ACC CCC ACT GGA 3513 
Leu His Pro His Thr Pro His Pro Gin Pro Ser Ala Thr Pro Thr Gly 
1140 1145 1150 1155 

CAG CAG CAA AGC CAA CAT GGT GGA AGT CAT CCT GCA CCC AGT CCT GTT 3561 
Gin Gin Gin Ser Gin His Gly Gly Ser His Pro Ala Pro Ser Pro Val 

1160 1165 1170 

CAG CAC CAT CAG CAC CAG GCC GCC CAG GCT CTC CAT CTG GCC AGT CCA 3609 
Gin His His Gin His Gin Ala Ala Gin Ala Leu His Leu Ala Ser Pro 

1175 1180 1185 

CAG. CAG CAG TCA GCC ATT TAC CAC GCG GGG CTT GCG CCA ACT CCA CCC 3657 
Gin Gin Gin Ser Ala He Tyr His Ala Gly Leu Ala Pro Thr Pro Pro 

1190 1195 1200 

TCC ATG ACA CCT GCC TCC AAC ACG CAG TCG CCA CAG AAT AGT TTC CCA 3705 
Ser Met Thr Pro Ala Ser Asn Thr Gin Ser Pro Gin Asn Ser Phe Pro 

1205 1210 1215 

GCA GCA CAA CAG ACT GTC TTT ACG ATC CAT CCT TCT CAC GTT CAG CCG 3753 
Ala Ala Gin Gin Thr Val Phe Thr He His Pro Ser His Val Gin Pro 
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1220 1225 1230 ,1235 

GCG TAT ACC AAC CCA CCC CAC ATG GCC CAC GTA CCT CAG GCT CAT GTA 3801 

Ala Tyr Thr Asn Pro Pro His Met Ala His Val Pro Gin Ala His Val 

1240 1245 1250 

CAG TCA GGA ATG GTT CCT TCT CAT CCA ACT GCC CAT GCG CCA ATG ATG 3849 
Gin Ser Gly Met Val Pro Ser His Pro Thr Ala His Ala Pro Met Met 

1255 1260 1265 

CTA ATG ACG ACA CAG CCA CCC GGC GGT CCC CAG GCC GCC CTC GCT CAA 3897 
Leu Met Thr Thr Gin Pro Pro Gly Gly Pro Gin Ala Ala Leu Ala Gin 

1270 1275 1280 

AGT GCA CTA CAG CCC ATT CCA GTC TCG ACA ACA GCG CAT TTC CCC TAT 3945 ' 
Ser Ala Leu Gin Pro lie Pro Val Ser Thr Thr Ala His Phe Pro Tyr 

1285 1290 1295 

ATG ACG CAC CCT TCA GTA CAA GCC CAC CAC CAA CAG CAG TTG 3987 
Met Thr His Pro Ser Val Gin Ala His His Gin Gin Gin Leu 
1300 1305 1310 

TAAGGCTGCC CTGGAGGAAC CGAAAGGCCA AATTCCCTCC TCCCTTCTAC TGCTTCTACC 4047 
AACTGGAAGC ACAGAAAACT AGAATTTCAT TTATTTTGTT TTTAAAATAT ATATGTTGAT 4107 
TTCTTGTAAC ATCCAATAGG AATGCTAACA GTTCACTTGC AGTGGAAGAT ACTTGGACCG 4167 
AGTAGAGGCA TTTAGGAACT TGGGGGCTAT TCCATAATTC CATATGCTGT TTCAGAGTCC 4227 
CGCAGGTACC CCAGCTCTGC TTGCCGAAAC TGGAAGTTAT TTATTTTTTA ATAACCCTTG 4287 
AAAGTCATGA ACACATCAGC TAGCAAAAGA AGTAACAAGA GTGATTCTTG CTGCTATTAC 4347 
TGCTAAAAAA AAAAAAAAAA 4367 

mi] 
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1 rarccGCACcrcccscTCCcacccTO^ 



1 F-1006 M R S A 

61 gccgcagctcx:tcggagtcccgcggtqqccaccgagtc^ 

5AAAPRSPAVATSSRRFAAAR 

25 WPGWRSLQRPARRSGRGOGG 
181 GCGGCCCCCXGACa^TCCCTCCGCCGCCCCTVCCCC^^ 

A5AAPGPYPSAAPPPPGPGPPP 
241 TCCaax a GAGCTCGCX:TCC C " iVC <JCC''l T^ 

65^KQSSPPSASDCPGSNGN6G 
301 GGaxrGTTTCGGCCaSGCTCCCGGCGGCTCCTTGGTCTCG 

H-xooa 

85GAFRPGS RRLLGLGGPPRPP 

361 GTCGTCGTCCTTCTCCCCCTCGCCAGCCCGGGCGCCCCTCCGGCCGCGCCi^^ 

105 VVVLLPIiASPGAPPAAPTRA 

421 TCCCCGCTCGGCGCCCGTGCGTCCCCGCCGCGTTCCGGCGTCTCCTTGGCGCGCCCGGCT 

125 SPLGARASPPRSGVSLARPA 

481 CCCGGCTGTCCCCGCCCGGCGTGCGAGCCGGTGTATGGGCCCCTCACCATGTC^^ 

F-1 

145 PGCPRPACBPVyCPLTMSLK 

541 CCCCAGCAGCAGCAGCAGCAGCAGCAGCA6CACX:AGCAGCAGC^^ 

165FQQQQQQQQQQQQQQQQQQQ 

601 CAGCAGCAGCAGCCGCCGCCCGCGGCTCXrCAATGTCCXXrAAGCCCGGC^ 

-R-2 



185 QQQQPPPAAAWVRKPGGSGL 

661 ClJU3CGTCCK:CCGCCGCCGCGCCTTCGCCG^lx:CIXlt3at^^^ 
R-1 

205 LASPAAAPSPSSS ^^^^^^^^^^^^^^^^^^^^^^^ 

225 T APSSVVAATSGGGRPGLGR 
781 GGlTOAAACAGTAACAAAGGACTXXrCTCAGTCTACGATT^ 

245 GRNSNKGLPQSTI SFDGIYA 
841 AATATGAGGATGGTrcATATACTTACATCAOTTCTTGGCTCC^ 

265 NMRMVHILTSVVGSKCEVQV 
9 CI AAAAATCGAGGTATATATGAAQGAGTITTTAAAACTTACAGTCC^^ 
285 KNGGIYEGVFKTYSPKCDLV 
961 CTTGATGCXrGCACATGAG/WlAAGTACAaAATCCAGTTC^^ 

305 LDAAHEKSTESSSGPKREEI 
1021 ATGGAGACH!A l ' in ' iXi ' i ' i <:AAATGTTCAGft CVi rr G TO 

325 MESILFKCSDFVVVQPKDMD 
1081 TCCAGTTATGCAAAAAGAGATGCTTTTACTGACTCTGCT 

345 SSYAKRDAFTDSAXSAKVNG 
1141 GAACACAAAGAGAAGGACCTGGAGCCCTGGGATCCAGGTGI^C^^ 

365 EHKEKDLEPWDAGELTANEE 
1201 CTTCAGGCTTTGGAAAATGACGTATCTAATGGATGGGATCCCAATGAT^ 

385 LEALE^TDVSNGWDPNDMFRY 
1261 AATGAAGAA?UiTTATGGTGTAGTGTCTACGTATGATAGCAGTTO 

'405 NEENYGVVSTYDSSLSSYTV 
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1321 CCCTTIUSAAAGAGftTAACTCAGAAG2^ATTTCT 

425 PLEHDNSEEFLSREARAKQL 
1381 GCAGAAGAAATTGAGTCTUUSTGCCCAGTA^ 

445 AEEIESSAQYKARVALEHDD 
1441 AGGACTGAGGAAGAAAAATACACAOCAGTTCAOAf^^ 

465 RSEEBKYTAVQRNSSEREGH 
1501 AGCATAAACACTAGGGAAAAIT^AATATATTCCTCCTCG^ 

485 SXNTRENKYIPPGQRNREVX 
1561 TCCTCGGGAAGTGQGAGACAGAATTCACCGCGTATGGGCX^^ 

505 SWGSGRQKSPRfilGQPGSGSH 
1621 CCATCAAGATCCACTTCTCACACTTCAGATTl^^ 

525 PSRSTSHTSDPNPNSGSDQR 

545 VVKGGVPWPSPCPS PSSRPP 
1741 TCTCGCTACCAGTCAGGTTCCAACTCTCT^ 

565 SRYQSGPNSLPPRAATPTRP 
1801 CCCTCCAQGCCCCX:CTCGCGGCCATCCAGACCCCCGTCTCACCCC^^ 

585 PSRPPSRPSRPPSHFSAHGS 
1861 CCAGCTCCTGTCTCTACTATGCCTAAACGCATGTCTTCAGA 

605 FAPVSTMPKRMS SBGPPRKS 
1921 CCAAAGGCCCAGCXSACATCCTCGAAATCACAGAGri^^ 

625 PKAQRHPRKHRVSAGRGSIS 
1981 AGTGGCCTAGAATrrTGTATCCCACAACCCACCCAGTGAA<^^ 

645 SGLEFVSHNPPSEAATPPVA 
2041 AGGACCAGTCCCTCGGGGGGAAC GTGGTC ATCAGTGGTCAGTGGGGT^ CAAGATTATCC 

665 RTSPSGGTWSSVVSGVPRLS 
2101 CCTAAAACTCJVTAGACCCAGGTCTCCCAGACMAACAGTATTGGAAATJ^^ 

685 PKTHRPRSPBQNSXGNTPSG 
2161 CCAGTTCTTGCTTCTCCCCAAGCTGGTATTATTCCAACT^^ 

705 PVLASPQAGIIPTEAVAKPI 
2221 CCAGCTGCATCTCCTACGCCTGCTAGTCCTGCATCGAAC^ 

725 PAASPTPASPASNRAVTPSS 
2281 GAGGCTAAAGATTCCAGGCTTCAAGATCAGAGGCAGAACTCTC 

745 EAKDSRLQDQRQNSPAGNKB 
2341 AATAOTAAACCCAATGAAACATCACCTAGCTTCTCAAAAGCTGAAAACAAAGCTAT AT^ 

765 NIKPHETSPSFSKA EHKGIS 
2401 CCJ^GTrCTrrrCTGAACATAGAAAACAGATTGATGATT^^ 

785 PVVSEHRKQI DDLKKFKNDF 
2461 AGGTTACAGCCAAOTICTACTTCTGAATCTATGGATC 

805 RLQPSSTSESKDQLI.NKNRE 
2521 GGAGAAAAATCAAGAGATTTGATCAAAGACAAAATTGAACCAAGTGCTAA^ 

825 GEKSKOLIKD RIEPSAKDSF 
2581 ATTGAAAATAGCAGCAGCAACTGTACCAGTGGCAGCAGCAAGCCGAATAG^ 

845 IBNSSSNCTSGSSKPNSPSI 
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2641 TCCCCTTCaAIACTTAGTAACACaGAG^ 

8€5 SPSXLSNTZHKRGPSVTSQG 

2701 (nrCiUSACTTCXZi^CCJUSCAirt;?^^ 

885 VQTSSPACKQBKDDKEBKKD 

2761 GCAiaCTGiJSCiiAGrrV^^ 

905 AAEQVRKSTLNPWAKBrNPR 

2821 TCCTOCroCC/WSCCAAAtSCCTTOTAC^ 

r-xa— ^ 

925 SPSQPKPSTTPTSPRPQAQP 
2881 AGCCCAOCTATGGTGGGTCailCaW^JiGCCAACT^ 

945 SPSMVCHQQPTPVYTQPVCF 
2941 GCAra^TATGATOT2mxaW3TCCCAGTGAGCX:C^^ 

965 APNMMyFVPVSPGVQPI.YPI 
3001 CCTATGACGCCCATGCCAGTCAATCAAGCCAAGACATAT^ 

985 PMTPMPVNQAKTYRAVPNMP 
3061 CA&CAGCGGCAAGACCAQCaTCATCAGAGTGCCATGATGCACCCAGC^^ 
1005 QQRQDQHHQ SA£9HHPASAAG 
3121 CCACCGATTGCAGCCACCCCACCAGCTTACIC^ 

1025 PPIAATPPAYSTQYVAYS PQ 
3181 CAGTTCCCAAATCAC3CCCCTTGTTCA{3CATC^^ 

1045 QFPNQPLVQHVPHYQSQHPH 
3241 LrUXJ ' rA TAGri^:CTGT A A!iaCAGGGTAATGCT&GAATGAT^ GCCC AG 

1065 VYSPVIQGNARMMAPPTHAQ 
3301 COTQCrrTTAGTATCriTCTTCAGC^^ 

1085 pglvsssatqygahbqtham 
3361 takxatgtcccaaattftccatacaacaag^^ 

1105 yacpklpynkets psfyfai 
3421 tcxacgggotcccritoctcagcagtatgcgcac^ 

1125 stgslaqqyahpnatlhpht 

3481 cc&caccctcagccttcagcxacccccactggacagc^ 

1145 phpqpsatptgqqqsqhggs 

1165 hpapspvqhhqhqaaqalhl. 

3601 gcxiagtccacagcagcagtcagccatitaccacgcggggctt^^ 

1185 aspqqqsaiyhaglaptpps 

3661 atgacacctgcctcxj^acacgcagtcgccacagaatagttic^^ 

1205 mtpasktqspqnsfpaaqqt 

3721 gtctttacgatcxatccttctcacgticagccggcgtata^ 

1225 vftihpshvqpaytnpphma 

37 81 cacgtacctcac3gctcatgtacac?ra^ggaatggttcct'^^ 

1245 hvpqahvqsgmvpshptaha 

3841 ccaatgatgctaatgacgacacagccaccx:ggcggtccccaggccg^ 

1265 pmmlmttqppggpqaalaqs 

3901 gcactacagoccattccagtctcgacaacagcgcatttc^ 

1285 alqpipvsttakfpymthps 

3961 gtacaagcccacxiaccaacagcagttgtaaggctgccctgc^ 

1305 vqahhqqql * 



3 



ailE#¥ 09-3100183 



8—304059 



[®4] 



4021 TCCCTCCTCCCTTCTACTGCTTCTACCAACTGGAAGCACAGAAAACTAGAATTT^ 
4081 TTTTGTTTTTAAAATATATATGTTGATTTCTTCTAA^ 
4141 CACrTCX:AGTGGAA(^TACrriX3GACCGAGTAGAGGCAT^ 

4201 ATAATTCCATATGCTGTTTCAGAGTCCCGOVGCn'ACCCCAGCTCTGCTTGCCGJ^ 
4261 AAGTTATTTATTTTTTAATAACCCTTGAAAGTCa^TGAAavC^^ 
4321 AACAAGAGTC»TTCTTGCTGCTATTACTGCT{A) „ 
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